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Gene encoding thanatin, vector containing it and 
disease-resistant transformed plants obtained 



The subject of the present invention is a DNA 
5 sequence encoding thanatin, a vector containing it for 
the transformation of a host organism, and the method 
of transforming the said organism. 

The invention relates more particularly to 
the transformation of plant cells and plants, the 
10 thanatin produced by the transformed plants conferring 
on them resistance to diseases, in particular of fungal 
origin . 

An increasing need already exists for making 
plants resistant against diseases, in particular fungal 
15 diseases, in order to reduce, or even eliminate, the 
need for treatment with antifungal protection products, 
with a view to protecting the environment. One means of 
increasing this resistance to diseases consists in 
transforming the plants so that they produce substances 
20 capable of providing their defence against these 
diseases . 

Various substances of natural origin, in 
particular peptides, are known which exhibit 
bactericidal or fungicidal properties, in particular 
25 against the fungi responsible for plant di 

However, the problem consists in finding such 



seases . 



substances which not only can be produced by 
transformed plants, but can still preserve their 
bactericidal or fungicidal properties and confer them 
on the said plants. For the purposes of the present 
5 invention, bactericidal or fungicidal is understood to 
mean the actual bactericidal or fungicidal properties 
and the bacteriostatic and fungistatic properties. 

Thanatin is a peptide produced by bacterial 
induction on adult Psodius sp, preferably 
10 maculiventrls. Its preparation by bacterial induction 
is described in patent application FR 2,733,237, as 
well as its antifungal and antibacterial properties in 



vi tro. 



After having first identified the thanatin 
15 gene, it was also found that it could be inserted into 
a host organism, in particular a plant, in order to 
express the thanatin and confer on the said host 
organism properties of resistance to fungal diseases 
and to diseases of bacterial origin, providing a 
20 particularly advantageous solution to the problem 
stated above . 

The subject of the invention is therefore 
first a nucleic acid fragment encoding thanatin, a 
chimeric gene comprising the said fragment encoding 
25 thanatin as well as heterologous regulatory elements at 
the 5' and 3' positions which can function in a host 
organism, in particular in plants, and a vector for 
transforming the host organisms containing this 



chimeric gene, and the transformed host organism. It 
also relates to a transformed plant cell containing at 
least one nucleic acid fragment encoding thanatin and a 
disease-resistant plant containing the said cell, in 
5 particular regenerated from this cell, it finally 

relates to a method of transforming plants to make them 
resistant to diseases, in which method a gene encoding 
thanatin is inserted by means of an appropriate vector. 

Thanatin is understood to mean according to 
10 the invention any peptide comprising essentially the 
peptide sequence of 11 amino acids which is described 
in patent application FR 2,733,237, as well as the 
equivalent homologous sequences in which certain amino 
acids are replaced by different but equivalent amino 
15 acids at sites which do not induce substantial 
modification of the antifungal or antibacterial 
activity of the said homologous sequence. Peptide 
sequence comprising essentially the peptide sequence 
described in patent application FR 2,733,237 is 
20 understood to mean not only the sequence defined by the 
sequence identifier No. 1 (SEQ ID NO 1), but also such 
a sequence comprising at either of its ends, or at 
both, peptide residues necessary for its expression and 
targeting in a host organism, in particular a plant 
25 cell or a plant, 

Thanatin is a peptide of formula (I) : 
Xaa-Ile He Tyr Cys Asn Arg Arg Thr Gly Lys Cys-Xab 

(I) 



in which: 



Xaa is NH2 or a variable residue having a 
sequence comprising from 1 to 10 amino acids, and 

Xab is OH or a variable residue having a 
5 sequence comprising from 0 to 5 amino acids. 

Advantageously, when Xaa comprises at least 
one amino acid, the latter is one of the 20 base amino 
acids and more particularly chosen from the group 
comprising Gly, Ser, Lys, Pro and Val. When Xab 
10 comprises at least one amino acid, the latter is one of 
the 20 base amino acids and more particularly chosen 
from the group comprising Gin, Arg and Met. 

According to a preferred embodiment of the 
invention, the two cysteine residues of the peptide of 
15 formula (I) form an intramolecular disulphide bridge. 

The present invention therefore relates first 
to a nucleic acid, in particular a DNA, fragment 
encoding the thanatin defined above. It may be, 
according to the invention, a fragment isolated from 
20 Psodius sp, preferably maculiventris, or alternatively 
a derived fragment, suitable for the expression of 
thanatin in the host organism where the peptide will be 
expressed. The nucleic acid fragment may be obtained 
using standard methods of isolation and purification, 
25 or alternatively by synthesis according to the 

customary techniques of successive hybridizations of 
synthetic oligonucleotides. These techniques are in 
particular described by Ausubel et al. 



According to the present invention, "nucleic 
acid fragment" is understood to mean a nucleotide 
sequence which may be of the DNA or RNA type, 
preferably of the DNA, in particular cDNA, especially 
5 double-stranded, type. 

According to one embodiment of the invention, 
the nucleic acid fragment encoding thanatin comprises 
the DNA sequence described by the sequence identifier 
No. 1 (SEQ ID NO 1), a homologous sequence or a 
10 sequence complementary to the said sequence. 

Advantageously, the nucleic acid fragment 
according to the invention comprises the DNA sequence 
described by the sequence identifier No. 2 (SEQ ID NO 
2), a homologous sequence or a sequence complementary 
15 to the said sequence. 

"Homologous" is understood to mean according 
to the invention a nucleic acid fragment having one or 
more sequence modifications relative to the nucleotide 
sequence described by the sequence identifier No. 1 or 
20 No. 2 and encoding thanatin. These modifications may be 
obtained according to the customary mutation 
techniques, or alternatively by choosing the synthetic 
oligonucleotides used in the preparation of the said 
sequence by hybridization. Given the multiple 
25 combinations of nucleic acids which may lead to the 
expression of the same amino acid, the differences 
between the reference sequence described by the 
sequence identifier No. 1 or No. 2 and the homologue 



may be great, especially since a DNA fragment of less 
than 100 nucleic acids in size, which can be produced 
by synthesis, is involved. Advantageously, the degree 
of homology will be at least 70% relative to the 
5 reference sequence, preferably at least 80%, more 
preferably at least 90%. These modifications are 
generally neutral, that is to say that they do not 
affect the primary sequence of the resulting thanatin. 

The present invention also relates to a 
10 chimeric gene (or an expression cassette) comprising a 
coding sequence as well as heterologous regulatory 
elements at the 5' and 3' positions which can function 
in a host organism, in particular plant cells or 
plants, the coding sequence comprising at least one DNA 
15 fragment encoding thanatin as defined above. 

Host organism is understood to mean any 
higher or lower mono- or pluricellular organism into 
which the chimeric gene according to the invention may 
be introduced, for the production of thanatin. It 
20 consists of in particular bacteria, for example 
E. coli, yeasts, in particular of the genera 
Saccharomyces or Kluyveromyces , or preferably plant 
cells and plants. 

"Plant cell" is understood to mean according 
25 to the invention any ceil derived from a plant and 
which may constitute undifferentiated tissues such as 
calli, and differentiated tissues such as embryos, 
plant portions, plants or seeds. 



"Plant" is understood to mean according to 
the invention any differentiated multicellular organism 
capable of photosynthesis, in particular 
monocotyledones or dicotyledones, more particularly 
5 cultivated plants intended or otherwise as animal feed 
or for human consumption, such as maize, wheat, colza, 
soya bean, rice, sugar cane, beet, tobacco, cotton and 



the like. 



The regulatory elements necessary for the 
10 expression of the DNA fragment encoding thanatin are 
well known to persons skilled in the art depending on 
the host organism. They comprise in particular promoter 
sequences, transcription enhancers, transit peptides, 
terminator sequences, including start and stop codonl. 
15 The means and methods for identifying and selecting the 
regulatory elements are well known to persons skilled 
in the art. 

The nucleic acid fragment according to the 
invention may also comprise a nucleic acid sequence 
20 fused in 5' and/or 3' to the sequence encoding thanatin, 
so as to Obtain, a "protein-thanatin" fusion protein, 
whose Cleavage by the enzymatic systems of the host 
organism allows the release of thanatin. This thanatin- 

fused protein may be a sion^i r^^r.^--^ 

y a Signal peptide or a transit 

25 peptide which makes it possible to control and orient 
the production of thanatin in a specific manner in a 
part of the host organism, such as for example the 
cytoplasm, the cell .embrane, or in the case of plants 



8 



in a particular type of tissue or in the extracellular 
matrix. 

According to one embodiment, the transit 
peptide may be a signal for chloroplast or 
5 mitochondrial addressing, which transit peptide is then 
cleaved in the chloroplast or the mitochondria. 

According to another embodiment of the 
invention, the signal peptide may be an N-terminal 
signal or "prepeptide", optionally in combination with 
10 a signal responsible for retaining the protein in the 
endoplasmic reticulum, or a peptide for vacuolar 
addressing or "propeptide". The endoplasmic reticulum 
is the site where operations of maturation of the 
protein produced, such as for example the cleavage of 
15 the signal peptide, are carried out by the "cellular 
machinery" . 

The invention relates more particularly to 
the transformation of plants. As regulatory promoter 
sequence in plants, there may be used any promoter 
20 sequence of a gene which is expressed naturally in 

plants, in particular a promoter of bacterial, viral or 
plant origin such as, for example, that of a gene for 
the small subunit of ribulose biscarboxylase (RuBisCO) 
or of a plant virus gene, for example that of 
25 cauliflower mosaic (CAMV 19S or 35S) , or a promoter 
inducible by pathogens such as tobacco PR-ia or 
asparagus AoPRT-L, it being possible for any known 
suitable promoter to be used. Preferably, a regulatory 



promoter sequence is used which promotes the 
overexpression of the coding sequence constitutively or 
inducibly by a pathogen attack, such as for example 
that comprising at least one histone promoter as 
5 described in application EP 0,507,698. 

According to the invention, it is also 
possible to use, in combination with the regulatory 
promoter sequence, other regulatory sequences which are 
situated between the promoter and the coding sequence, 
10 such as transcription enhancers such as for example the 
tobacco mosaic virus (TMV) translation enhancer 
described in application WO 87/07644, or the tobacco 
etch virus (TEV) translation enhancer described by 
Carrington & Freed, or transit peptides, either single 
15 or double, and in this case optionally separated by an 
intermediate sequence, that is to say comprising, in 
the direction of transcription, a sequence encoding a 
transit peptide of a plant gene encoding a plastid 
localization enzyme, a portion of sequence of the N- 
20 terminal mature portion of a plant gene encoding a 
plastid localization enzyme, and then a sequence 
encoding a second transit peptide of a plant gene 
encoding a plastid localization enzyme consisting of a 
portion Of sequence of the N-terminal mature portion of 

25 a plant gene encoding a plastid ]orAii, = *.' 

° f-Ldstia localization enzyme, as 

described in application EP 0,508,909. As transit 
peptide, there may be mentioned the signal peptide of 
the tobacco PR-la gene described by Cornelissen et al.. 
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represented with its coding sequence by the sequence 
identifier No 3. 

The sequence encoding the fusion protein 
signal peptide PR-la-thanatin and this fusion protein 
5 also form part of the present invention. This sequence 
is in particular described by the sequence identifier 
No. 5, more particularly the coding part of this 
sequence, corresponding to bases 12 to 164. 

As regulatory terminator or polyadenylation 
10 sequence, there may be used any corresponding sequence 
of bacterial origin, such as for example the nos 
terminator from Agrobacterium tumefaciens, or 
alternatively of plant origin, such as for example a 
histone terminator as described in application 
15 EP 0,633,317. 

According to the present invention, the 
chimeric gene may also comprise a selectable marker 
suitable for the transformed host organism. Such 
selectable markers are well known to persons skilled in 
20 the art. They may be a gene for resistance to 

antibiotics, such as penicillin, or alternatively a 
gene for tolerance of herbicides for plants. 

The present invention also relates to a 
cloning or expression vector for the transformation of 
25 a host organism containing at least one chimeric gene 
as defined above. This vector comprises, in addition to 
the above chimeric gene, at least one replication 
origin. This vector may consist of a plasmid, a cosmid. 
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a bacteriophage or a virus, transformed by the 
introduction of the chimeric gene according to the 
invention. Such transformation vectors, depending on 
the host organism to be transformed, are well known to 
5 persons skilled in the art and are widely described in 
the literature. 

For the transformation of plant cells or 
plants, they may consist. in particular of a virus which 
may be used for the transformation of developed plants 
10 and containing, in addition, its own elements for 

replication and expression. Preferably, the vector for 
transforming plant cells or plants according to the 
invention is a plasmid. 

The subject of the invention is also a method 
15 of transforming host organisms, in particular plant 
cells, by integration of at least one nucleic acid 
fragment or a chimeric gene as defined above, which 
transformation may be obtained by any known appropriate 
means widely described in the specialist literature and 
20 in particular the references cited in the present 

application, more particularly by the vector according 
to the invention. 

A series of methods consists in bombarding 
cells or protoplasts with particles to which the DNA 
25 sequences are attached. Another series of methods 
consists in using, as means of transferring into the 
plant, a chimeric gene inserted into a Ti plasmid from 



12 

Agrobacterium tumefaciens or an Ri plasmid from 
Agrobacterium rhizogenes . 

Other methods may be used, such as 
microinjection or electroporation or alternatively 
5 direct precipitation by means of PEG. 

Persons skilled in the art will make the 
choice of the appropriate method depending on the 
nature of the host organism, in particular the plant 
cell or the plant. 
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15 



The subject of the present invention is also 
the host organisms, in particular plant cells or 
plants, transformed and containing an effective 
quantity of a chimeric gene comprising a sequence 
encoding the thanatin defined above. 

The subject of the present invention is also 
the plants containing transformed cells, in particular 
the plants regenerated from transformed cells. The 
regeneration is obtained by any appropriate method 
which depends on the nature of the species, as for 
20 example described in the references above. 

For the methods of transforming plant cells 
and of regenerating plants, the following patents and 
patent applications may be mentioned: US 4,459,355, 
US 4,536,475, US 5,464,763, US 5,177,010, US 5,187,073, 
.25 EP 267,159, EP 604 662, EP 672 752, US 4,945,050, 

US 5,036,006, US 5,100,792, US 5,371,014, US 5,478,744, 
US 5,179,022, US 5,565,346, US 5,484,956, US 5,508,468, 
US 5,538,877, US 5,554,798, US 5,489,520, US 5,510,318, 
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US 5,204,253, US 5,405,765, EP 442,174, EP 486,233, 
EP 486,234, EP 539,563, EP 674,725, WO 91/02071 and 
WO 95/06128. 

The present invention also relates to the 
5 transformed plants derived from the cultivation and/or 
crossing of the above regenerated plants, as well as 
the seeds of transformed plants. 

The plants thus transformed are resistant to 
certain diseases, in particular to certain fungal or 
10 bacterial diseases. Consequently, the DNA sequence 
encoding thanatin may be integrated with the main 
objective of producing plants resistant to the said 
diseases, thanatin being effective against fungal 
diseases such as those caused by Cercospora, in 
15 particular Cercospora beticola, Cladosporium, in 
particular Cladosporium herbarum, Fusarium, in 
particular Fusarium culmorum or Fusarium graminearum or 
by Phytophthora, in particular Phytophthora cinnamomi. 

The chimeric gene may also advantageously 
20 comprise at least one selectable marker, such as one or 
more herbicide tolerance genes. 

The DNA sequence encoding thanatin may also 
be integrated as a selectable marker during the 
transformation of plants with other sequences encoding 
25 other peptides or proteins of interest such as, for 
example, herbicide tolerance genes. 

Such herbicide tolerance genes are well known 
to a person skilled in the art and are in particular 



described in patent applications EP 115 673, 

WO 87/04181, EP 337 899, WO 96/38567 or WO 97/04103. 

Of course, the transformed cells and plants 
according to the invention may comprise, in addition to 
5 the sequence encoding thanatin, other heterologous 

sequences encoding other additional peptides capable of 
conferring on the plant resistance to other diseases of 
bacterial or fungal origin. 

The other sequences may be integrated by 
LO means of the same vector comprising a chimeric gene, 
which comprises a first sequence encoding thanatin and 
at least one other sequence encoding another peptide or 
protein of interest. 

They may also be integrated by means of 
5 another vector comprising at least the said other 

sequence, according to the customary techniques defined 
above . 

The plants according to the invention may 
also be obtained by crossing parents, one carrying the 
) gene according to the invention encoding thanatin, the 
other carrying a gene encoding at least one other 
peptide or protein of interest. 

Among the sequences encoding other antifungal 
peptides, there may be mentioned that encoding 
drosomycin, which is described in patent application FR 
•2,725,992 and by Fehlbaum et al. (1994), and in the 
unpublished patent application FR 97 09115 filed on 
24 July 1997, or that encoding androctonin described in 
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patent application FR 2,745,004 and in unpublished 
patent application FR 97 10362 filed on 20 August 1997. 

The examples below make it possible to 
illustrate the invention, the preparation of the 
5 sequence encoding thanatin, of the chimeric gene, of 
the integration vector and of the transformed plants. 
Figures 1 to 5 in the annex describe the schematic 
structures of some plasmids prepared for the 
construction of chimeric genes. In these figures, the 
10 different restriction sites are marked in italics. 

Example 1; Construct ion of the chimeric genes 

All the techniques used below are standard 
laboratory techniques. The detailed protocols of these 
15 techniques are in particular described in Ausubel et 
al. 
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pRPa-MD-P : Creation of a plasinid containing the signal 
peptide of the tobacco PR- la gene 

The two complementary synthetic 
oligonucleotides Oligo 1 and Oligo 2 below are 
hybridized at 65°C for 5 minutes and then by slowly 
reducing the temperature to SO^C for 30'. 



25 Oligo 1: 



GCGTCGACGC GATGGGTTTC GTGCTTTTCT CTCAGCTTCC 
ATCTTTCCTT CTTGTGTCTA CTCTTCTTCT TTTCC 3' 
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Oligo 2: 5' TCGCCGGCAC GGCAAGAGTA AGAGATCACA AGGAAAAGAA 

GAAGAGTAGA CACAAGAAGG AAAGATGGAA GC 3' 

After hybridization between Oligo 1 and Oligo 
5 2, the DNA remaining single-stranded serves as a 

template for the Klenow fragment of polymerase I of E. 
con (under the standard conditions recommended by the 
manufacturer (New England Biolabs) ) for the creation of 

m 

the double-stranded oligonucleotide starting from the 3' 
10 end of each oligo. The double-stranded oligonucleotide 
obtained is then digested with the restriction enzymes 
Sacll and Nael and cloned into the plasmid pBS II SK{-) 
(Stratagene) digested with the same restriction 
enzymes. A clone is then obtained comprising the region 
15 encoding the signal peptide of the tobacco PR-la gene 
(SEQ ID NO 3) . 



pRPA-PS-PRla-than: Creation of a sequence encoding 
thanatin fused to the signal peptide PR-la without a 
20 non transcribed region in 3' 

The two synthetic oligonucleotides with 
complementary sequences Oligo 3 and Oligo 4 based on 
the operating conditions described for pRPA-MD-P. 



25 Oligo 3: 5' 



GGTTCCAAGA AGCCAGTGCC AATCATCTAC TGCAACAGGA CG 3' 
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CCGGATCCGT CGACACGTTC GCCTCGCCGA GCTCACATCC 
TCTGGCACTT ACCAGTCCTC CTGTTGCAGT AGATGATTGG 
CACTGGC 3' 



5 After hybridization between Oligo 3 and Oligo 

4, the DNA remaining single-stranded serves as a 
template for the Klenow fragment of polymerase I of E. 
coli (under the standard conditions recommended by the 
manufacturer (New England Biolabs) ) for the creation of 
10 the double-stranded oligonucleotide starting from the 3' 
end of each oligo. This double-stranded oligonucleotide 
containing the coding part of thanatin (SEQ id NO 1) is 
then directly cloned into the plasmid pRPA-MD-P which 
has been digested with the restriction enzyme Nael. The 
15 correct orientation of the clone obtained is checked by 
sequencing. A clone is then obtained comprising the 
region encoding the fusion protein PR-la-thanatin 
situated between the Ncol restriction sites at the N- 
terminal end and the Seal, SacII and BamHI restriction 
20 sites at the C-terminal end (SEQ ID NO 4) . 

pRPA-RD-229 : Creation of an expression vector in plants 
comprising the sequence encoding the fusion protein PR- 
la-thanatin 



25 



The plasmid pRTL-2 GUS, derived from the 
plasmid pUC-19, was obtained from Dr Jim Carrington 
(Texas A&M University, not described) . This plamid, 
whose schematic structure is represented in Figure 1, 
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contains the double CaMV 35S promoter isolated from the 
cauliflower mosaic virus (CaMV 2x35S promoter, Odell et 
al., 1985) which directs the expression of an RNA 
containing [lacuna] tobacco etch virus 5' untranslated 
5 sequence (TEV 5' UTR; Carrington & Freed, 1990), the 
E.coli P-glucorinidase gene (GUS; Jefferson et al., 
1987) followed by the polyadenylation site of the CaMV 
35S RNA (CaMV polyA; Odell et al . , 1985). 

The plasmid pRTL-2 GUS is digested with the 
10 restriction enzymes Ncol and BamHI, and the large DNA 
fragment is purified. The plasmid pRPA-PS-PRla-than is 
digested with the restriction enzymes Ncol and BamHI, 
and the small DNA fragment containing the region 
encoding the fusion protein PR-la-thanatin is purified. 
15 The two DNA fragments purified are then ligated 

together in an expression cassette in plants which 
synthesizes a PR-la-thanatin fusion protein. The 
schematic structure of this expression cassette is 
represented in Figure 2. "PR-la-thanatin" represents 
20 the coding region for the fusion protein PR-la-thanatin 
Of PRPA-RD-230. The thanatin is transported to the 
extracellular matrix of the plant by the action of the 
Signal peptide PR-la. 

25 pRPA-RD-195 : Creation of a plasmid containing a 
modified multiple cloning site 

The plasmid pRPA-RD-195 is a plasmid derived 
from pUC-19 which contains a modified multiple cloning 



site. The complementary synthetic oligonucleotides 
Oligo 5 and Oligo 6 below are hybridized and made 
double-stranded according to the procedure described 
for pRPA-MD-P. 



Oligo 5: 5' 



AGGGCCCCCT AGGGTTTAAA CGGCCAGTCA GGCCGAATTC 
GAGCTCGGTA CCCGGGGATC CTCTAGAGTC GACCTGCAGG 
CATGC 3' 



.0 Oligo 6: 5' 



CCCTGAACCA GGCTCGAGGG CGCGCCTTAA TTAAAAGCTT 
GCATGCCTGC AGGTCGACTC TAGAGG 3' 



The double-Stranded oligonucleotide obtained 
is then ligated into pUC-19 which has been previously 

5 digested with the restriction enzymes EcoRI and Hindlll 
and made blunt ended using the Klenow fragment of DNA 
polymerase I of E. coli. A vector is obtained 
containing multiple cloning sites in' order to 
facilitate the introduction of expression cassettes 

) into an Agrobacterium tumefaciens vector plasmid. The 
schematic structure of this multiple cloning site is 
represented in Figure 3. 



pRPA-RD-232 : Introduction of the PR-la-thanatin 
expression cassette of pRPA-RD-229 into pRPA-RD-195 

The plasmid pRPA-RD-230 is digested with the 
restriction enzyme Hindlll. The DNA fragment containing 
the PR-la-thanatin expression cassette is purified. The 
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purified fragment is then ligated into pRPA-RP-195 
which has previously been digested with the restriction 
enzyme Hindlll and dephosphorylated with calf 
intestinal phosphatase. 



pRPA-RD-174 : Plasmid derived from pRPA-BL-150A 
(EP 0,508,909) containing the bromoxynil tolerance gene 
of pRPA-BL-237 (EP 0,508,909) 

The bromoxynil tolerance gene is isolated 
.0 from pRPA-BL-237 by a PGR gene amplification. The 

fragment obtained is blunt-ended and is cloned into the 
EcoRI site of pRPA-BL-lSOA which has been made blunt- 
ended by the action of Klenow polymerase under standard 
conditions. An Agrobacterium tumefaciens vector is 
5 obtained which contains the bromoxynil tolerance gene 
close to its right border, a kanamycin tolerance gene 
close to its left border and a multiple cloning site 
between these two genes. 

The schematic structure of pRPA-RD-174 is 
) represented in Figure 4. In this figure, "nos" 
represents the Agrobacterium tumefaciens nopaline 
synthase polyadenylation site (Sevan et al . , 1983), 
"NOS pro" represents the Agrobacterium tumefaciens 
nopaline synthase promoter (Bevan et al., 1983), "NPT 
II" represents the neomycin phosphotransferase gene of 
the E. coli Tn5 transposon (Rothstein et al., 1981), 
"35S pro" represents the 35S promoter isolated from the 
cauliflower mosaic virus (Odell et al., 1985), "SRX" 
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5 



represents the nitrilase gene isolated from K. ozaenae 
(Stalker et al., 1988), and "RB" and "LB" represent 
respectively the right and left borders of the sequence 
of an Agrohacterium tumefaciens Ti plasmid. 

pRPA-RD-184 : Addition of a new unique restriction site 
into pRFA-RD-174 

The complementary synthetic oligonucleotides 
Oligo 7 and Oligo 8 below are hybridized and made 
10 double-stranded according to the procedure described 
for pRPA-MD-P. 



Oligo 7: 5' 
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CCGGCCAGTC AGGCCACACT TAATTAAGTT TAAACGCGGC 
CCCGGCGCGC CTAGGTGTGT GCTCGAGGGC CCAACCTCAG 
TACCTGGTTC AGG 3' 



Oligo 8: 5' 



CCGGCCTGAA CCAGGTACTG AGGTTGGGCC CTCGAGCACA 
CACCTAGGCG CGCCGGGGCC GCGTTTAAAC TTAATTAAGT 
GTGGCCTGAC TGG 3' 
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The double-stranded oligonucleotide 
hybridized (95 base pairs) is purified after separation 
on agarose gel (3% Nusieve, FMC) . The plasmid pRPA-RD- 
174 is digested with the restriction enzyme Xmal, and 
25 the large DNA fragment is purified. The two DNA 
fragments obtained are then ligated. 

A plasmid derived from pRPA-RD-174 is 
obtained comprising other restriction sites between the 
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bromoxynil tolerance gene and the selectable marker 
kanamycin gene. 

The schematic structure of the plasmid pRPA- 
RD-184 is represented in Figure 5 where the terms 
5 "nos", "NPT II", "NOS pro", "35S pro", "BRX gene", "RB" 
and "LB" have the same meaning as in Figure 4. 

pRPA-RD-235 : Creation of an Agrobacteriwn 
vector containing the gene construct encoding thanatin 
10 directed towards the extracellular matrix 

The plasmid pRPA-RD-232 is digested with the 
restriction enzymes Pmel and AscI, and the DNA fragment 
containing the gene for PR-la-thanatin is purified. The 
plasmid pRPA-RD-184 is digested with the same 
15 restriction enzymes. The DNA fragment containing the 
PR-la-thanatin expression cassette is then ligated into 
pRPA-RD-184. An Agrobacterivm tumefaciens vector is 
thus obtained containing the sequence encoding the 
fusion protein PR-la-thanatin which leads to the 
20 expression of thanatin in the plant's extracellular 
matrix. 
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Example 2 : Herbicide tolerance of transformed tobaccos 
2-1 - Transformation 

The vector pRPA-RD-235 is introduced into the 
Agrobacterium tumefaciens EHAlOl strain (Hood et al., 
1987) carrying the cosmid pTVK29l (Komari et al.. 
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1986). The transformation technique is based on the 
procedure of Horsch et ai. (1985). 
2 . 2 - Regeneration 

The regeneration of the tobacco PBD6 (origin 
5 SEITA France) from foliar explants is carried out on a 
Murashige and Skoog (MS) basal medium comprising 30 g/1 
of sucrose as well as 200 ng/ml of kanamycin. The 
foliar explants are removed from plants cultivated in a 
greenhouse or in vitro and regenerated according to the 
10 foliar disc technique (Horsh et al., 1985) in three 

successive stages: the first comprises the induction of 
shoots on a medium supplemented with 30 g/i of sucrose 
containing 0.05 mg/1 of naphthylacetic acid (NAA) and 
2 mg/1 of benzylaminopurine (BAP) for 15 days. The 
15 shoots formed during this stage are then developed for 
10 days by cultivating on an MS medium supplemented 
with 30 g/1 of sucrose but containing no hormones. 
Next, the developed shoots are removed and they are 
cultivated on an MS rooting medium with half the 
20 content of salts, vitamins and sugar and containing no 
hormone. After about 15 days, the rooted shoots are 
transferred into the soil. 
Li2 - Bromoxynil tolerance 

Twenty transformed plants were regenerated 
25 and transferred into a greenhouse for the pRPA-RD-235 
construct. These plants were then treated in a 
greenhouse at the 5-leaf stage with an aqueous 



suspension of Pardner corresponding to 0.2 kg of 
bromoxynil active material per hectare. 

All the plants showing complete tolerance to 
bromoxynil are then used in various experiments which 
5 show that the expression of thanatin by the transformed 
plants makes them resistant to fungal attacks. 



REFERENCES 

F.A. Ausubel et al. (Ed. Greene). Current Protocols in 
0 Molecular Biology. Publ . Wiley s Sons. 

M. Bevan et al. (1983). Nuc. Acids Res. 11: 369-385. 

Carrington and Freed (1990). J. Virol. 64: 1590-1597. 

Ehret-Sabatier et al. (1996). The Journal of Biological 

Chemistry, 271, 47, 29537-29544. 
5 Horsch et al. (1985). Science 227: 1229-1231. 
Jefferson et al. (1987). EMBO J. 6: 3901-3907. 
Komari et al, (1986). J. Bacterid. 166: 88-94. 
Rothstein et al. (1981). Cold Spring Harb. Symp. Quant. 
Biol. 45: 99-105. 

Stalker et al. (1988). J. Biol. Chem. 263: 6310-6314. 
J.T. Odell et al. (1985). Nature 313: 810-812. 



wo 99/24594 



25 



PCT/FR98/02375 



CLAIMS 



1. Nucleic acid fragment, characterized in 
that it comprises a nucleic acid sequence encoding 

5 thanatin. 

2. Nucleic acid fragment according to claim 
1, characterized in that it is a DNA-type nucleotide 
sequence . 

3. Nucleic acid fragment according to claim 
0 2, characterized in that the DNA-type nucleotide 

sequence comprises the DNA sequence described by the 
sequence identifier No. 1 (SEQ ID NO 1), a homologous 
sequence or a sequence complementary to the said 
sequence . 

5 4. Nucleic acid fragment according to claim 

3, characterized in that the DNA-type nucleotide 
sequence comprises the DNA sequence described by the 
sequence identifier No. 2 (SEQ ID NO 2), a homologous 
sequence or a sequence complementary to the said 
sequence. 

5. Nucleic acid fragment according to one 
of claims 1 to 4, characterized in that it comprises a 
nucleic acid sequence fused in 5' and/or in 3' to the 
sequence encoding thanatin, so as to obtain a "protein- 
thanatin" fusion protein. 



6. Nucleic acid fragment according to claim 
5, characterized in that the protein is a signal 
peptide or a transit peptide. 

7. Nucleic acid fragment according to claim 
5 6, characterized in that the signal peptide is the 

signal peptide of the tobacco PR-la gene. 

8. Nucleic acid fragment according to claim 

1, characterized in that .it comprises the DNA sequence 

described by the sequence identifier No. 5 (SEQ ID NO 

0 5), a homologous sequence or a sequence complementary 
to the said sequence. 

9. Nucleic acid fragment according to claim 
8, characterized in that it comprises the coding part 
of SEQ ID NO 5, corresponding to bases 12 to 164. 

^ 10. Fusion protein "protein-thanatin" , 

characterized in that the protein is a signal peptide 
or a transit peptide. 

11. Fusion protein according to claim 10, 
characterized in that the signal peptide is the signal 

0 peptide of the tobacco PR-la gene. 

12. Fusion protein according to claim 11, 
characterized in that it is described by the sequence 
identifier No. 5 (SEQ ID NO 5). 

13. Chimeric gene comprising a coding 
sequence as well as heterologous regulatory elements at 
the 5' and 3' positions which can function in a host 
organism, in particular plants, characterized in that 



the coding sequence comprises at least one DNA fragment 
encoding thanatin as defined in claims 1 to 9. 

14. Chimeric gene according to claim 13, 
characterized in that the host organism is chosen from 

5 plant cells and plants. 

15. Chimeric gene according to either of 
claims 13 and 14, characterized in that it also 
comprises a selectable marker. 

16. Cloning or expression vector for the 
0 transformation of a host organism, characterized in 

that it comprises at least one replication origin and 
at least one chimeric gene as defined in claims 13 to 
15. 

17. Vector according to claim 16, 

5 characterized in that it is a virus used for the 

transformation of developed plants and containing, in 
addition, its own elements for replication and 
expression. 

18. Vector according to claim 16, 
0 characterized in that it is a plasmid. 

19. Transformed host organisms, 
characterized in that they contain an effective 
quantity of a chimeric gene according to claims 13 to 
15. 

20. Transformed host organism according to 
claim 19, characterized in that it consists of plant 
cells or plants. 
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21. Transformed host organism according to 
claim 20, characterized in that it is a plant 
containing transformed cells. 

22. Host organism according to claim 21, 
characterized in that the plant is regenerated from 
transformed cells. 

23. Transformed plant cell, characterized in 
that it contains a nucleic acid fragment according to 
claims 1 to 9, or a chimeric gene according to claims 
13 to 15. 

24. Transformed plant resistant to diseases, 
characterized in that it comprises at least one 
transformed plant cell according to claim 23. 

25. Transformed plant according to claim 24, 
characterized in that it is resistant to diseases 
caused by Cercospora, in particular Cercospora 
beticola, Cladosporium, in particular Cladosporlum 
herbarum, Fusarium, in particular Fusarium culmorum or 
FusariuiTi graminearum or by Phytophthora, in particular 
Phytophthora cinnamomi . 

26. Disease-resistant transformed plant, 
characterized in that it is derived from the 
cultivation and/or crossing of plants according to 
either of claims 24 and 25. 

27. Seeds of transformed plants according to 
one of claims 24 to 26. 

28. Method of transforming host organisms, 
in particular plant cells or plants, characterized in 
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that at least one nucleic acid fragment according to 
claims 1 to 9 or a chimeric gene according to one of 
claims 13 to 15 is inserted into the said host 
organism. 

5 29. Method of transforming plants to make 

them resistant to fungal or bacterial diseases, 
characterized in that at least one nucleic acid 
fragment according to claims 1 to 9 or a chimeric gene 
according to claims 13 to 15 is inserted into the 
10 plant. 

30. Method according to either of claims 28 
and 29, characterized in that the chimeric gene is 
inserted by means of a vector according to one of 
claims 16 to 18. 



SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(iii) NUMBER OF SEQUENCES: 13 

(2) INFORMATION FOR SEQ ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs ' 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
{ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .33 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
ATC ATC TAC TGC AAC AGG AGG ACT GGT AAG TGC 
He He Tyr Cys Asn Arg Arg Thr Gly Lys Cys 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 2; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE:. 

(A) NAME/KEY: CDS 
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(B) LOCATION:!. .63 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GGT TCC AAG AAG CCA GTG CCA ATC ATC TAC TGC AAC AGG AGG ACT GGT 
Gly Ser Lys Lys Pro Val Pro He He Tyr Cys Asn Arg Arg Thr Gly 

AAG TCG CAG AGG ATG 
Lys Cys Gin Arg Met 

20 



10 (2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 98 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
1^ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .63 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



AS 



GGT TCC AAG AAG CCA GTG CCA ATC ATC TAC TGC AAC AGG AGG ACT GGT 48 
Gly Ser Lys Lys Pro Val Pro He He Tyr Cys Asn Arg Arg Thr Gly 
^5 10 15 

25 AAG TGC CAG AGG ATG TGAGCTCGGC GAGGCGAACG TGTCGACGGA TCCGG 
Lys Cys Gin Arg Met 

20 
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(2) INFORMATION FOR SEQ ID NO: 4: 
(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: double 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 12. .101 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GCGTCGACGC C ATG GGT TTC GTG CTT TTC TCT CAG CTT CCA TCT TTC CTT 

Met Gly Phe Val Leu Phe Ser Gin Leu Pro Ser Phe Leu 

CTT GTG TCT ACT CTT CTT CTT TTC CTT GTG ATC TCT CAC TCT TGC CGT 
Leu Val Ser Thr Leu Leu Leu Phe Leu Val He Ser His Ser Cys Arg 
15 20 25 
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GCC GGCGA 

106 

Ala 
30 



5 (2) INFORMATION FOR SEQ ID NO; 5; 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 12. .164 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GCGTCGACGC C ATG GGT TTC GTG CTT TTC TCT CAG CTT CCA TCT TTC CTT 

Met Gly Phe Val Leu Phe Ser Gin Leu Pro Ser Phe Leu 

CTT GTG TCT AT^TT CTT CTT TTC CTT GTG ATC TCT CAC TCT TGC CGT 98 
20 Leu Val Ser Thr Leu Leu Leu Phe Leu Val He Ser His Ser Cys Arg 
" 20 25 

GCC GGT TCC AAG AAG CCA GTG CCA ATC ATC TAC TGC AAC AGG AGG ACT 146 
Ala Gly ser Lys Lys Pro Val Pro He He Tyr Cys Asn Arg Arg Thr 

30 -is 

" 40 45 

25 GGT AAG TGC CAG AGG ATG TGAGCTCGGC GAGGCGAACG TGTCGACGGA TCC 197 
Gly Lys Cys Gin Arg Met 

50 



50 



(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleotide 

^ (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide 1" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

10 GCGTCGACGC GATGGGTTTC GTGCTTTTCT CTCAGCTTCC ATCTTTCCTT CTTGTGTCTA 60 

CTCTTCTTCT TTTCC 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc « "«svti*-k^«.< « i • 

*x»^«./ue5c « synthetic oligonucleotide 2" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

TCGCCGGCAC GGCAAGAGTA AGAGATCACA AGGAAAAGAA GAAGAGTAGA CACAAGAAGG 60 
AAAGATGGAA GC 

72 



25 (2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 4 base pairs 

(B) TYPE: nucleotide 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION :/desc = "synthetic oligonucleotide 3" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GGTTCCAAGA AGCCAGTGCC AATCATCTAC TGCAACAGGA CG 



42 



10 



15 



(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc « "synthetic oligonucleotide 4" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



CCGGATCCGT CGACACGTTC GCCTCGCCGA GCTCACATCC TCTGGCACTT ACCAGTCCTC 60 
CTGTTGCAGT AGATGATTGG CACTGGC 



87 



20 



25 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc - "synthetic oligonucleotide 5" 



10 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
ACGCCCCCCT AGGGTTTAAA CGGCCAGTCA CGCCGAATTC GAGCTCGGTA CCCGGGGATC 
CTCTAGAGTC GACCTGCAGG CATGC 

85 

5 (2) INFORMATION FOR SEQ ID NO: 110: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "<5^/T.^K^♦- ^ ^ i 

v«.,ae5c synthetic oligonucleotide 6" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CCCTGAACCA GGCTCGAGGG CGCGCCTTAA TTAAAAGCTT GCATGCCTGC AGGTCGACTC TACAGG60 



20 



(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "svni-h***. i ^ 

/uesc synthetic oligonucleotide 7« 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

25 CCGGCCAGTC AGGCCACACT TAATTAAGTT TAAarrrrrr r^nr^r^ 

MftiiAAGTT TAAACGCGGC CCCGGCGCGC CTAGGTGTGT 60 

GCTCGAGGGC CCAACCTCAG TACCTGGTTC AGG 



93 



(2) INFORMATION FOR SEQ ID NO: 13: 



8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide 8" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



CCGGCCTGAA CCAGGTACTG AGGTTGGGCC CTCGAGCACA CACCTAGGCG 
GCGTTTAAAC TTAATTAAGT GTGGCCTGAC TGG 
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